On the usefulness of STFT phase spectrum in human listening tests

نویسندگان

  • Kuldip K. Paliwal
  • Leigh D. Alsteris
چکیده

The short-time Fourier transform (STFT) of a speech signal has two components: the magnitude spectrum and the phase spectrum. In this paper, the relative importance of short-time magnitude and phase spectra for speech perception is investigated. Human perception experiments are conducted to measure intelligibility of speech stimuli synthesized either from magnitude spectra or phase spectra. It is traditionally believed that the magnitude spectrum plays a dominant role for small window durations (20–40ms); while the phase spectrum is more important for large window durations (>1s). It is shown in this paper that even for small window durations, the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis–modification–synthesis parameters are properly selected. 2004 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement in Hearing Aids Using Conjugate Symmetry of DFT and SNR-Perception Models

Most of the speech enhancement algorithms use the magnitude of STFT while phase is kept unchanged [3]. In this paper the magnitude of STFT of noisy speech is kept unchanged while the phase is modified. Modified complex spectrum of speech is obtained by combining unchanged magnitude spectrum and modified phase spectrum. This modification results into cancellation of low energy components (noise)...

متن کامل

ISAR Image Improvement Using STFT Kernel Width Optimization Based On Minimum Entropy Criterion

Nowadays, Radar systems have many applications and radar imaging is one of the most important of these applications. Inverse Synthetic Aperture Radar (ISAR) is used to form an image from moving targets. Conventional methods use Fourier transform to retrieve Doppler information. However, because of maneuvering of the target, the Doppler spectrum becomes time-varying and the image is blurred. Joi...

متن کامل

Further intelligibility results from human listening tests using the short-time phase spectrum

State-of-the-art automatic speech recognition systems (ASRs) use only the short-time magnitude spectrum for feature extraction; the short-time phase spectrum is generally ignored in these systems. Results from our recent human listening tests indicate that the short-time phase spectrum can significantly contribute to speech intelligibility over small window durations (i.e., 20–40 ms). This is a...

متن کامل

Timo Gerkmann , Martin Krawczyk - Becker , and Jonathan Le Roux ] [ History and recent advances ] Phase Processing for Single - Channel Speech Enhancement

Date of publication: 12 February 2015 ith the advancement of technology, both assisted listening devices and speech communication devices are becoming more portable and also more frequently used. As a consequence, users of devices such as hearing aids, cochlear implants, and mobile telephones, expect their devices to work robustly anywhere and at any time. This holds in particular for challengi...

متن کامل

Test Method Facet and the Construct Validity of Listening Comprehension Tests

The assessment of listening abilities is one of the least understood, least developed and, yet, one of the most important areas of language testing and assessment. It is particularly important because of its potential wash-back effects on classroom practices. Given the fact that listening tests play a great role in assessing the language proficiency of students, they are expected to enjoy a hig...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2005